Speech coding using trajectory compression and multiple sensors
نویسندگان
چکیده
This paper presents a new method of multi-frame speech coding based upon polynomial approximation of speech feature trajectories incorporating multiple sensor signals from microphones, accelerometer, electro-glottograph, and microradar. The trajectory polynomial approximation exploits the inter-frame information redundancy encountered in natural speech. The trajectory method is applicable to features such as spectral parameters, gain, and pitch. The method is suitable for application to a frame vocoder to further reduce the transmission bit rate. Multiple transducers increase the intelligibility and quality of the coded speech in noisy environments. Experimental results are obtained by embedding the new method into an enhanced mixed-excitation linear prediction vocoder. The resulting vocoder operates at 1533 bps and preliminary intelligibility and quality tests show results comparable to those of the original 2400 bps vocoder.
منابع مشابه
Real-Time Multiple-Description Coding of Speech Signals
When sending speech data over lossy networks like the internet, multiple-description (MD) coding is a means to improve the perceived quality by dividing the data into multiple descriptions which are then sent as separate packets. In doing so the speech signal can still be decoded even if only parts of these descriptions are received. The present paper describes the structure of a software which...
متن کاملComparing several models for perceptual long-term modeling of amplitude and phase trajectories of sinusoidal speech
The so-called Long-Term (LT) modeling of sinusoidal parameters, proposed in previous papers, consists in modeling the entire time-trajectory of amplitude and phase parameters over large sections of voiced speech, differing from usual ShortTerm models, which are defined on a frame-by-frame basis. In the present paper, we focus on a specific novel contribution to this general framework: the compa...
متن کاملDigital Audio: from Lossless to Transparent Coding
We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially more compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple codings can reveal origi...
متن کاملLossless and Perceptual Coding of Digital Audio
We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially better compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple coding can reveal orig...
متن کاملSpeech compression a novel method pdf
Text summarization is a process that reduces the size of the text document. Purpose, we use part of speech tagging to recognize types of the text words. speech compression applications Compression rate is a scale to decrease the size of text summary. speech compression abstract A higher.This paper illustrates a novel method of speech compression and transmission. This method saves the transmiss...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004